74 research outputs found
DarkHorse: a method for genome-wide prediction of horizontal gene transfer
A new approach to rapid, genome-wide identification and ranking of horizontal transfer candidate proteins is presented. The method is quantitative, reproducible, and computationally undemanding. It can be combined with genomic signature and/or phylogenetic tree-building procedures to improve accuracy and efficiency. The method is also useful for retrospective assessments of horizontal transfer prediction reliability, recognizing orthologous sequences that may have been previously overlooked or unavailable. These features are demonstrated in bacterial, archaeal, and eukaryotic examples
Predicting N-terminal myristoylation sites in plant proteins
BACKGROUND: N-terminal myristoylation plays a vital role in membrane targeting and signal transduction in plant responses to environmental stress. Although N-myristoyltransferase enzymatic function is conserved across plant, animal, and fungal kingdoms, exact substrate specificities vary, making it difficult to predict protein myristoylation accurately within specific taxonomic groups. RESULTS: A new method for predicting N-terminal myristoylation sites specifically in plants has been developed and statistically tested for sensitivity, specificity, and robustness. Compared to previously available methods, the new model is both more sensitive in detecting known positives, and more selective in avoiding false positives. Scores of myristoylated and non-myristoylated proteins are more widely separated than with other methods, greatly reducing ambiguity and the number of sequences giving intermediate, uninformative results. The prediction model is available at . CONCLUSION: Superior performance of the new model is due to the selection of a plant-specific training set, covering 266 unique sequence examples from 40 different species, the use of a probability-based hidden Markov model to obtain predictive scores, and a threshold cutoff value chosen to provide maximum positive-negative discrimination. The new model has been used to predict 589 plant proteins likely to contain N-terminal myristoylation signals, and to analyze the functional families in which these proteins occur
Recommended from our members
Multi-Omic Profiling of Melophlus Sponges Reveals Diverse Metabolomic and Microbiome Architectures that Are Non-overlapping with Ecological Neighbors.
Marine sponge holobionts, defined as filter-feeding sponge hosts together with their associated microbiomes, are prolific sources of natural products. The inventory of natural products that have been isolated from marine sponges is extensive. Here, using untargeted mass spectrometry, we demonstrate that sponges harbor a far greater diversity of low-abundance natural products that have evaded discovery. While these low-abundance natural products may not be feasible to isolate, insights into their chemical structures can be gleaned by careful curation of mass fragmentation spectra. Sponges are also some of the most complex, multi-organismal holobiont communities in the oceans. We overlay sponge metabolomes with their microbiome structures and detailed metagenomic characterization to discover candidate gene clusters that encode production of sponge-derived natural products. The multi-omic profiling strategy for sponges that we describe here enables quantitative comparison of sponge metabolomes and microbiomes to address, among other questions, the ecological relevance of sponge natural products and for the phylochemical assignment of previously undescribed sponge identities
Xenorhodopsins, an enigmatic new class of microbial rhodopsins horizontally transferred between archaea and bacteria
Based on unique, coherent properties of phylogenetic analysis, key amino acid substitutions and structural modeling, we have identified a new class of unusual microbial rhodopsins related to the Anabaena sensory rhodopsin (ASR) protein, including multiple homologs not previously recognized. We propose the name xenorhodopsin for this class, reflecting a taxonomically diverse membership spanning five different Bacterial phyla as well as the Euryarchaeotal class Nanohaloarchaea. The patchy phylogenetic distribution of xenorhodopsin homologs is consistent with historical dissemination through horizontal gene transfer. Shared characteristics of xenorhodopsin-containing microbes include the absence of flagellar motility and isolation from high light habitats
Distinctive Archaeal Composition of an Artisanal Crystallizer Pond and Functional Insights Into Salt-Saturated Hypersaline Environment Adaptation
Hypersaline environments represent some of the most challenging settings for life on Earth. Extremely halophilic microorganisms have been selected to colonize and thrive in these extreme environments by virtue of a broad spectrum of adaptations to counter high salinity and osmotic stress. Although there is substantial data on microbial taxonomic diversity in these challenging ecosystems and their primary osmoadaptation mechanisms, less is known about how hypersaline environments shape the genomes of microbial inhabitants at the functional level. In this study, we analyzed the microbial communities in five ponds along the discontinuous salinity gradient from brackish to salt-saturated environments and sequenced the metagenome of the salt (halite) precipitation pond in the artisanal Cáhuil Solar Saltern system. We combined field measurements with spectrophotometric pigment analysis and flow cytometry to characterize the microbial ecology of the pond ecosystems, including primary producers and applied metagenomic sequencing for analysis of archaeal and bacterial taxonomic diversity of the salt crystallizer harvest pond. Comparative metagenomic analysis of the Cáhuil salt crystallizer pond against microbial communities from other salt-saturated aquatic environments revealed a dominance of the archaeal genus Halorubrum and showed an unexpectedly low abundance of Haloquadratum in the Cáhuil system. Functional comparison of 26 hypersaline microbial metagenomes revealed a high proportion of sequences associated with nucleotide excision repair, helicases, replication and restriction-methylation systems in all of them. Moreover, we found distinctive functional signatures between the microbial communities from salt-saturated (>30% [w/v] total salinity) compared to sub-saturated hypersaline environments mainly due to a higher representation of sequences related to replication, recombination and DNA repair in the former. The current study expands our understanding of the diversity and distribution of halophilic microbial populations inhabiting salt-saturated habitats and the functional attributes that sustain them
The Natural Product Domain Seeker NaPDoS: A Phylogeny Based Bioinformatic Tool to Classify Secondary Metabolite Gene Diversity
New bioinformatic tools are needed to analyze the growing volume of DNA sequence data. This is especially true in the case of secondary metabolite biosynthesis, where the highly repetitive nature of the associated genes creates major challenges for accurate sequence assembly and analysis. Here we introduce the web tool Natural Product Domain Seeker (NaPDoS), which provides an automated method to assess the secondary metabolite biosynthetic gene diversity and novelty of strains or environments. NaPDoS analyses are based on the phylogenetic relationships of sequence tags derived from polyketide synthase (PKS) and non-ribosomal peptide synthetase (NRPS) genes, respectively. The sequence tags correspond to PKS-derived ketosynthase domains and NRPS-derived condensation domains and are compared to an internal database of experimentally characterized biosynthetic genes. NaPDoS provides a rapid mechanism to extract and classify ketosynthase and condensation domains from PCR products, genomes, and metagenomic datasets. Close database matches provide a mechanism to infer the generalized structures of secondary metabolites while new phylogenetic lineages provide targets for the discovery of new enzyme architectures or mechanisms of secondary metabolite assembly. Here we outline the main features of NaPDoS and test it on four draft genome sequences and two metagenomic datasets. The results provide a rapid method to assess secondary metabolite biosynthetic gene diversity and richness in organisms or environments and a mechanism to identify genes that may be associated with uncharacterized biochemistry
- …